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(54) Title: NUCLEIC ACIDS, PROTEINS, AND ANTIBODIES 

(57) Abstract: The present invention relates to novel immune/hematopoietic-related polynucleotides and the polypeptides encoded 
by these polynucleotides herein collectively known as "immune/hematopoietic antigens", and the use of such immune/hematopoi- 
etic antigens for detecting immune/hematopoietic-related diseases and/or disorders, particularly the presence of cancer and cancer 
metastases of cells of hematopoietic origin. More specifically, isolated immune/hematopoietic associated nucleic acid molecules are 
provided encoding novel immune/hematopoietic associated polypeptides. Novel immune/hematopoietic polypeptides and antibod- 
ies that bind to these polypeptides are provided. Also provided are vectors, host cells, and recombinant and synthetic methods for 
producing human immune/hematopoietic associated polynucleotides and/or polypeptides. The invention further relates to diagnostic 
and therapeutic methods useful for diagnosing, treating, preventing and/or prognosing disorders related to the immune system or cells 
and tissues associated with hematopoiesis, including cancers of cells of hematopoietic origin, and therapeutic methods for treating 
such disorders. The invention further relates to screening methods for identifying agonists and antagonists of polynucleotides and 
polypeptides of the invention. The present invention further relates to methods and/or compositions for inhibiting the production and 
function of the polypeptides of the present invention. 
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Nucleic Acids, Proteins, and Antibodies 

[001] This application refers to a "Sequence Listing" that is provided only on 
electronic media in computer readable form pursuant to Administrative Instructions 
Section 801(a)(i). The Sequence Listing forms a part of this description pursuant to 
Rule 5.2 and Administrative Instructions Sections 801 to 806, and is hereby 
incorporated in its entirety. 

[002] The Sequence Listing is provided as an electronic file (PC004PCT_seqList.txt, 
76,977,474 bytes in size, created on January 16, 2001) on four identical compact discs 
(CD-R), labeled "COPY 1," "COPY 2," "COPY 3," and "CRF " The Sequence 
Listing complies with Annex C of the Administrative Instructions, and may be viewed, 
for example, on an IBM-PC machine running the MS-Windows operating system by 
using the V viewer software, version 2000 (see World Wide Web URL: 
http://www.fileviewer.com). 

Field of the Invention 

[003] The present invention relates to novel immune system and hematopoietic 
related (herein "immune/hematopoietic'') polynucleotides, the polypeptides encoded 
by these polynucleotides herein collectively referred to as "immune/h&salopoietic 
antigens," and antibodies that immunospecifically bind these polypeptides, and the use 
of such immune/hematopoietic polynucleotides, antigens, and antibodies for detecting, 
treating, preventing and/or prognosing disorders of the immune system, including, but 
not limited to, the presence of cancer and cancer metastases of cells of hematopitic 
origin. More specifically, isolated imniune/hematopoietic nucleic acid molecules are 
provided encoding novel immune/hematopoietic polypeptides. Novel 
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immune/hematopoietic polypeptides and antibodies that bind to these polypeptides are 
provided. Also provided are vectors, host cells, and recombinant and synthetic 
methods for producing human immune/hematopoietic polynucleotides, polypeptides, 
and/or antibodies. The invention further relates to diagnostic and therapeutic methods 
useful for diagnosing, treating, preventing and/or prognosing disorders related to 
hematopoiesis and the immune system, including cancers of cells of hematopoietic 
origins, and therapeutic methods for treating such disorders. The invention further 
relates to screening methods for identifying agonists and antagonists of 
polynucleotides and polypeptides of the invention. The invention further relates to 
methods and/or compositions for inhibiting or promoting the production and/or 
function of the polypeptides of the invention. . 

Background of the Invention 

[004] The immune system is an intricate network composed of cells, tissues and 
soluble substances that function to protect the body from invasion by foreign 
substances and pathogens. The major cells of the immune system are white blood 
cells, including lymphocytes, such as B cells and T cells, and myeloid cells, such as 
basophils, eosinophils, neutrophils, mast cells, monocytes, macrophages and dendritic 
cells. The soluble components of the immune system, are molecules (often 
polypeptides) that are not contained within cells, but rather are found in extracellular 
fluids such as lymph and blood plasma. Some of the major soluble substances are 
antibodies, complement proteins, and cytokines. 

[005] Cells of the immune system (as well as red blood cells and platelets) are 
derived from a common precursor stem cell by a process known as hematopoiesis. 
During fetal life hematopoiesis occurs in the liver and spleen, but in the adult, 
hematopoiesis occurs mainly in bone marrow. The stem cells from which all blood 
cells are derived proliferate and differentiate into the various blood cell lineages, (e.g., 
lymphocytes (B or T cells), myeloid cells (basophils, eosinophils, neutrophils, mast 
cells, macrophages), platelets, or red blood cells) in response to signals received from 
other cells (e.g., stromal cells) in the bone marrow microenvironment and also from 
cytokines. Many of the cytokines that promote the growth and differentiation of 
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hematopoietic stem cells are known as "colony stimulating factors". For example, 
interleukin-3 (IL-3, and also known as multi-colony stimulating factor) and 
granulocyte macrophage colony stimulationg factor (GM-CSF), which are released by 
activated macrophages T cells, stimulate the production of macrophages and 
granulocytes (myelopoiesis). Stem cell factor (SCF, c-kit ligand) is a growth factor for 
primitive lymphoid and myeloid hematopoietic bone marrow progenitor cells 
expressing the early cell surface marker CD34. Other hematopoeitic cytokines/growth 
factors include, but are not limited to macrophage colony stimulating factor (M-CSF) 
and granulocyte colony stimulating factor (G-CSF). Interleukins-1, 6, and 7 have also 
been shown to function as hematopoietic growth factors/cytokines. 

[006] The maturation of lymphocytes has an added layer of complexity in that each 
individual T and B cell generates a unique antigen specific receptor - a B cell receptor 
(antibody) in the case of B cells or a T cell receptor in the case of T cells . Because it 
is possible that B and T cells may generate autoreactive antigen receptors, B and T 
cells undergo negative selection processes that eliminate autoreactive lymphocytes 
from the circulating pool of mature lymphocytes. Defects in negative selection may 
contribute to the occurrence of autoimmune disease. In addition, T cells undergo a 
process of positive selection in which T cells are selected for their ability to interact 
with the major histocompatibility antigens. In the thymus, T cells also differentiate 
into one of two classes, CD4+ T helper (Th) cells or CD8+ cytotoxic T cells. The 
majority of the maturation and selection procees occurs in the bone marrow for B 
cells, whereas T cell progenitor cells migrate from the bone marrow to the thymus 
where they complete meir maturation. 

[0071 Cells of the immune system circulate throughout the body in both the lymph 
and the blood. Immune cells will leave the circulatory system and enter the tissues by 
a process known as diapedesis. Immune cells return to the circulatory system via 
travel in the lymph. Situated along the lymphatic vessels are lymph nodes, which are 
small nodular aggregates of lymphoid tissues. The architecture of the lymph node is 
designed to facilitate acquired immune responses, with antigen presenting cells, B 
cells and T cells all in close proximity. Antigen presenting cells (APCs, e.g., dendritic 
cells, macrophages, B cells) display antigen on their surface in the form of peptides 
associated MHC class II molecules to T helper cells. T helper cells with T-cell 
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receptors specific for the given antigen become activated if they bind to the peptide 
MHC complexes and receive co-stimulatory signals (e.g, stimulation of CD28 on the 
Tcell by B7 molecules on the APC). Activated T helper cells proliferate, secreted 
cytokines, and can stimulate "antigen-specific B cells or T cells to become activated. 
Once activated, cytotoxic T cells proliferate and are able to induce apoptosis of cells 
expressing specific antigen on their surface as a peptide in the context of MHC Class I 
molecules. Activated B cells also proliferate and may either enter into germinal center 
and undergo a process of affinity maturation of their antigen receptor, or differentiate 
into antibody forming cells (plasma cells) that secrete large quantities of antigen- 
specific antibody. 

[008] Aside from lymphocytes and antigen presenting cells, introduced above, there 
are several other accessory cells in the immune system including neutrophils, 
eosinophils, basophils, mast cells, and Natural Killer (NK) cells. NK cells are large 
granular lymphocytes that have cytotoxic function, especially against cells infected 
with intracellular pathogens, and may function in the eradication of cancer cells. 
Neutrophils are phagocytic cells that play a key role in the inflammatory process. 
Activated mast cells release granules containing histamine and other active agents 
which are effective against large parasites and also contribute to allergic reactions and 
asthma. Eosinophils bear Fc receptors for IgG and IgE, and participate in the killing 
of antibody coated parasites. 

[009] The immune system can be classified into the acquired and innate immune 
system. The cells of the innate immune system (e.g., neutrophils, eosinophils, 
basophils, mast cells) are not antigen specific and their action is not enhanced by 
repeated exposure to the same antigen. The cells of the acquired immune system (B 
and T cells) are antigen specific and repeated exposure of B and T cells to an antigen 
results in improved immune repsonses (memory responses) produced by these cell 
types. The cells and products of the acquired immune system can function to focus the 
action of the innate immune system. For example, eosinophils are not in themselves 
antigen specific, but as a result of expression of Fc receptors on their surface, their 
activity can be focused on a specific antigen to which an antibody response has been 
made by the acquired immune system. For a more extensive review of the immune 
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system, see Fundamental Immunology , 4th edition, ed. William Paul, Lippincott- 
R&ven Pub. (1998). 

[010] As illustrated above, an immune response is seldom carried out by a single cell 
type, but rather requires the coordinated efforts of several cell types. In order to 
coordinate an immune response, it is necessary that cells of the immune system 
communicate with each other and with other cells of the body. Communication 
between cells may be made by cell-cell contact, between membrane bound molecules 
on each cell, or by the interaction of soluble components of the immune system with 
cellular receptors. Usually, such receptors are embedded in the plasma membrane, but 
there also exist a subset of cytoplasmic and nuclear receptors. Communication, or 
signaling, between cell types may have one or more of a variety of consequences 
including, activation, proliferation, differentiation, or apoptosis. Activation and 
differentiation may result in the expression or secretion of polypeptides, or other 
molecules, which in turn affect the function of other cells and/or molecules of the 
immune system. 

[0111 Signaling molecules of the immune system, including not only cellular 
receptors and ligands, but also the downstream effectors of the receptors and/or 
ligands, may be described as immunomodulators. In addition, immunomodulators 
(also known as biological response modifiers) include microbial or synthetic 
substances and products of activated cells. The mechanism of action of 
immunomodulators usually involves a complicated interplay of various regulator and 
effector systems. Immunomodulators may enhance (immunoprophylaxis, 
immunostimulation), restore (immunosubstitution, immunorestoration) or suppress 
(immunosuppression, immunodeviation) immunological functions or activities. 
Immunomodulators may be, for example, cytokines, cytokine receptors, inhibitors of 
DNA synthesis, intacellular receptors, or components of signal transduction pathways, 
some of which are described in more detail below: 

Cytokines and Cytokine Receptors 

[012] Cytokines are small soluble proteins produced by one cell that alter the 
behavior or other properties of another cell or itself. Thus, by definition, cytokines are 
immunomodulatory molecules. Many cytokines have multiple biological effects and 
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are critical to the regulation of the immune response. For a review on cytokines, refer 
to Chapter 11 of Cellular and Molecular Immunology by Abbas et al. (1991). 

(013] Immune responses of the acquired immune system can be classified into two 
broad classes of immune responses: humoral (antibody-mediated) immune responses 
and cell-mediated immune responses (cell-mediated, i.e., cytotoxic T cell, immune 
response). Both types of responses require activation of CD4+ T helper cells. 
Depending on several factors, of which one factor is the cytokine environment, T 
helper (Th) cells may differentiate into either Thl cells that promote cell-mediated 
responses or Th2 cells that promote humoral responses. Thl cells, which produce 
interferon (IFN)-gamma, interleukin (IL)-2 and tumor necrosis factor (TNF)-beta, 
evoke cell-mediated immunity and phagocyte-dependent inflammation. Th2 cells, 
which produce IL-4, IL-5, IL-6, IL-9, IL-10, and IL-13, evoke strong antibody 
responses (including those of the IgE class) and eosinophil accumulation, but inhibit 
several functions of phagocytic cells (phagocyte-independent inflammation). The 
presence of Thl or Th2 T cells can have a dramatic effect on the outcome of infection. 
A Thl response during the course of infection by the intracellular bacterium 
Mycobacterium leprae (M. leprae) is protective, whereas a Th2 response is much less 
so. Patients that make Th2 response to M. leprae develop full-blown lepromatous 
leprosy which is eventually fatal. The (mis)regulation of Thl and Th2 responses have 
been implicated in the pathogenesis of several diseases, including several organ- 
specific autoimmune disorders such as Crohn's disease, sarcoidosis, acute kidney 
allograft rejection, some unexplained recurrent abortions. For a review on Thl and 
Th2 subsets, see Romagnani, Ann. Allergy Asthma Immunol. 85:9-18 (2000). 

[014] From the preceding example it is apparent that cytokines have play key roles 
on the class and effectiveness of the immune response. It is important to note that 
cytokines have effects on cell of both the innate and acquired immune systems and are 
produced by both immune and non-immune cells types. 

[015] Other cytokines such as interferon-alpha (secreted by leukocytes) and 
interferon-beta (secreted by fibroblasts and many other cell types) are cytokines that 
function to target the immune system towards fighting viral infections. The binding of 
interferon-alpha and -beta to cells results in a cellular signalling cascade which 
ultimately results in the inhibition of viral replication in infected cells, the upregulation 
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of MHC class I expression on cells, and the activation, of Natural Killer (NK) cells. 
Interferons are useful in the diagnosis, treatment and prevention of viral infections and 
cancers. 

Intracellular immunomodulators. 

[016] Immunomodulatory proteins are not only cytokines or cytokine receptors. 
They may also be located intracellular^. For, example they may be intracellular 
components of a signaling pathway, or even intracellular receptors for certain 
signaling molecules such as steroids. One example of intracellular immunomodulatory 
proteins are the immunophilins such as cyclophilin and FK binding protein (FKBP). 
These immunophilins are peptidyl-prolyl cis-trans isomerases, though their enzymatic 
ability may be distinct from their role as immunomodulators. When these molecules 
are bound by the drugs, Cyclosporin A and FK506, respectively, they in turn inhibit 
the action of activated calcineurin. Calcineurin is a calcium activated serine/threonine 
kinase which dephosphorylates the transcription factor Nuclear Factor of Activated T 
cells (NF-AT). Upon dephosphorylation, NF-AT enters the nucleus and induces the 
transcription of several genes including IL-2. In sum, the immunophilin:drug 
complexes are able to inhibit clonal expansion of T cells by inhibiting IL-2 synthesis. 
In addition, FKBP when bound to another drug, rapamycin, can. also inhibit the 
signaling of IL-2 through the IL-2 receptor. FKBP:rapamycin complexes accomplish 
the inhibition of IL-2 signaling not by binding to calcineurin, but by binding to and * 
inactivating the protein kinases associated with IL-2 signaling resulting in the same 
outcome, the inhibition of T cell clonal expansion. 

[017J Defects in any one or more of the components of the immune system can lead 
to diseaseor susceptibility to infectious diseases. Two major classes of immune 
system disorders are autoimmune diseases, and immunodeficiencies. In 
autoimmunity, the effector mechanisms of the immune system (e.g., antigen specific 
antibodies and cellular cytotoxicity, e.g., of cytotoxic T cells, or natural killer cells) 
are misdirected at self rather than foreign antigens resulting is tissue distraction. 
Diseases classified as or associated with immunodeficiencies are diseases in which the 
immune system is unable to mount an effective immune response. A classic example 
of an immunodeficiecy is X-linked agammaglobulinemia in which an intracellular 
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signalling molecule expressed in B lymphocytes (Bruton's tyrosine kinase) is 
defective. The loss of function of this kinase prevents B cell maturation, thus patients 
with X linked agammaglobulinemia do not have mature B cells and are unable to 
make antibody, and as a result are susceptible to infection. 
[018] The discovery of new human immune/hematopoietic polynucleotides, the 
polypeptides encoded by them, and antibodies that immunospecifically bind these 
polypeptides, satisfies a need in the art by providing new compositions which are 
useful in the diagnosis, treatment, prevention and/or prognosis of disorders of the 
immune system, including, but not limited to, autoimmune disorders, (e.g., systemic 
lupus erythematosus, rheumatoid arthritis, idiopathic thrombocytopenic purpura and 
multiple sclerosis) and immunodeficiencies (e.g., X-linked agammaglobulinemia, 
severe combined immunodeficiency, Wiskott-Aldrich syndrome, and ataxia 
telangiectasia). Additionally, immune/hematopoietic molecules would be useful as 
agents to boost immune responsiveness to pathogens or to suppress immune reactions, 
for example as is necessary in conjunction with organ transplantation. 

Summary of the Invention 
[019] The present invention relates to novel immune/hematopoietic related 
polynucleotides, the polypeptides encoded by these polynucleotides herein collectively 
referred to as "immune/hematopoietic antigens," and antibodies that 
immunospecifically bind these polypeptides, and the use of such 
immune/hematopoietic polynucleotides, antigens, and antibodies for detecting, 
treating, preventing and/or prognosing disorders of the immune system, including, but 
not limited to, the presence of cancer and cancer metastases of cells of hematopoietic 
origin. More specifically, isolated immune/hematopoietic nucleic acid molecules are 
provided encoding novel immune/hematopoietic polypeptides. Novel 
immune/hematopoietic polypeptides and antibodies that bind to these polypeptides are 
provided. Also provided are vectors, host cells, and recombinant and synthetic 
methods for producing human immune/hematopoietic polynucleotides, polypeptides, 
and/or antibodies. The invention further relates to diagnostic and therapeutic methods 
useful for diagnosing, treating, preventing and/or prognosing disorders related to the 
immune system or hematopoitic cells or tisues, including cancers of cells of 



WO 01/57182 



PCT/US01/01354 



hematopoietic origin, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for identifying agonists and antagonists 
of polynucleotides and polypeptides of the invention. The invention further relates to 
methods and/or compositions for inhibiting or promoting the production and/or 
function of the polypeptides of the invention. 

Detailed Description 

Tables 

[0201 Table 1A summarizes some of the polynucleotides encompassed by the 

invention (including cDNA clones related to the sequences (Clone ID NO:Z), contig 

sequences (contig identifier (Contig ID:) and contig nucleotide sequence identifier 

(SEQ ID NO:X)) and further summarizes certain characteristics of these 

polynucleotides and the polypeptides encoded thereby. The first column provides a 

unique clone identifier, "Clone ID NO:Z", for a cDNA plasmid related to each 

immune/hematopoietic associated contig sequence disclosed in Table 1 A. The second 

column provides a unique contig identifier, "Contig ID:" for each of the contig 

sequences disclosed in Table 1 A. The third column provides the sequence identifier, 

"SEQ ID NO:X", for each of the contig polynucleotide sequences disclosed in Table 

1A. The fourth column, "ORF (From-To)", provides the location (i.e., nucleotide 

position numbers) within the polynucleotide sequence of SEQ ID NO:X that delineate 

the preferred open reading frame (ORF) shown in the sequence listing and referenced 

in Table 1A as SEQ ID NO:Y (column 5). Column 6 lists residues comprising 

predicted epitopes contained in the polypeptides encoded by each of the preferred 

ORFs (SEQ ID NO:Y). Identification of potential immunogenic regions was 

performed according to the method of Jameson and Wolf (CABIOS, 4:181-186 

(1988)); specifically, the Genetics Computer Group (GCG) implementation of this 

algorithm, embodied in the program PEPTTDESTRUCTURE (Wisconsin Package 

vlO.0, Genetics Computer Group (GCG), Madison, Wise). This method returns a 

measure of the probability that a given residue is found on the surface of the protein. 

Regions where the antigenic index score is greater than 0.9 over at least 6 amino acids 

are indicated in Table 1A as "Predicted Epitopes." In particular embodiments, 

immune/hematopoietic associated polypeptides of the invention comprise, or 
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alternatively consist of, one, two, three, four, five or more of the predicted epitopes 
described in Table 1 A. It will be appreciated that depending on the analytical criteria 
used to predict antigenic determinants, the exact address of the determinant may vary 
slightly. Column 7, "Tissue Distribution" shows the expression profile of tissue, cells, 
and/or cell line libraries which express the polynucleotides of the invention. The first 
number in column 7 (preceding the colon), represents the tissue/cell source identifier 
code corresponding to the code and description provided in Table 4. Expression of 
these polynucleotides was not observed in the other tissues and/or cell libraries tested. 
For those identifier codes in which the first two letters are not "AR", the second 
number in column 7 (following the colon) represents the number of times a sequence 
corresponding to the reference polynucleotide sequence (e.g., SEQ ID NO:X) was 
identified in the tissue/cell source. Those tissue/cell source identifier codes in which 
the first two letters are "AR" designate information generated using DNA array 
technology. Utilizing this technology, cDNAs were amplified by PCR and then 
transferred, in duplicate, onto the array. Gene expression was assayed through 
hybridization of first strand cDNA probes to the DNA array. cDNA probes were 
generated from total RNA extracted from a variety of different tissues and cell lines. 
Probe synthesis was performed in the presence of 33 P dCTP, using oligo(dT) to prime 
reverse transcription. After hybridization, high stringency washing conditions were 
employed to remove non-specific hybrids from the array. The remaining signal, 
emanating from each gene target, was measured using a Phosphorimager. Gene 
expression was reported as Phosphor Stimulating Luminescence (PSL) which reflects 
the level of phosphor signal generated from the probe hybridized to each of the gene 
targets represented on the array. A local background signal subtraction was performed 
before the total signal generated from each array was used to normalize gene 
expression between the different hybridizations. The value presented after "[array 
code]:" represents the mean of the duplicate values, following background subtraction 
and probe normalization. One of skill in the art could routinely use this information to 
identify normal and/or diseased tissue(s) which show a predominant expression partem 
of the corresponding polynucleotide of the invention or to identify polynucleotides 
which show predominant and/or specific tissue and/or cell expression. Column 8, 
"Cytologic Band," provides the chromosomal location of polynucleotides 

10 



WO 01/57182 



PCT/US01/01354 



corresponding to SEQ ID NO:X. Chromosomal location was determined by finding 
exact matches to EST and cDNA sequences contained in the NCBI (National Center 
for Biotechnology Information) UniGene database. Given a presumptive chromosomal 
location, disease locus association was determined by comparison with the Morbid 
Map, derived from Online Mendelian Inheritance in Man (Online Mendelian 
Inheritance in Man, OMIM™. McKusick-Nathans Institute for Genetic Medicine, 
Johns Hopkins University (Baltimore, MD) and National Center for Biotechnology 
Information, National Library of Medicine (Bethesda, MD) 2000. World Wide Web 
URL: http://ww.ncbi.nlrn.nih.gov/omim/). If the putative chromosomal location of 
the Query overlapped with the chromosomal location of a Morbid Map entry, an 
OMIM identification number is provided in Table 1A, column 9 labeled "OMIM 
Disease Reference^)". A key to the OMIM reference identification numbers is 
provided in Table 5. 

[021] Table IB summarizes additional polynucleotides encompassed by the 
invention (including cDNA clones related to the sequences (Clone ID NO:Z), contig 
sequences (contig identifier (Contig ID:) contig nucleotide sequence identifiers (SEQ 
ID NO:X)), and genomic sequences (SEQ ID NO:B). The first column provides a 
unique clone identifier, "Clone ID NO:Z", for a cDNA clone related to each contig 
sequence. The second column provides the sequence identifier, "SEQ ID NO:X", for 
each contig sequence. The third column provides a unique contig identifier, "Contig 
ID:" for each contig sequence. The fourth column, provides a BAC identifier "BAC 
ID NO: A" for the BAC clone referenced in the corresponding row of the table. The 
fifth column provides the nucleotide sequence identifier, "SEQ ID NO:B" for a 
fragment of the BAC clone identified in column four of the corresponding row of the 
table. The sixth column, "Exon From-To", provides the location (i.e., nucleotide 
position numbers) within the polynucleotide sequence of SEQ ID NO:B which 
delineate certain polynucleotides of the invention that are also exemplary members of 
polynucleotide sequences mat encode polypeptides of the invention (e.g., polypeptides 
containing amino acid sequences encoded by the polynucleotide sequences delineated 
in column six, and fragments and variants thereof). 

[022] Table 2 summarizes homology and features of some of the polypeptides of the 
invention. The first column provides a unique clone identifier, "Clone ID NO:Z", 
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corresponding to a cDNA disclosed in Table 1A. The second column provides the 
unique contig identifier, "Contig ID:" corresponding to contigs in Table 1A and 
allowing for correlation with the information in Table 1 A. The third column provides 
the sequence identifier, "SEQ ID NO:X", for the contig polynucleotide sequences. 
The fourth column provides the analysis method by which the homology/identity 
disclosed in the row was determined. Comparisons were made between polypeptides 
encoded by the polynucleotides of the invention and either a non-redundant protein 
database (herein referred to as "NR"), or a database of protein families (herein referred 
to as "PFAM") as further described below. The fifth column provides a description of 
PFAM/NR hits having significant matches to a polypeptide of the invention. Column 
six provides the accession number of the PFAM/NR hit disclosed in the fifth column. 
Column seven, "Score/Percent Identity", provides a quality score or the percent 
identity, of the hit disclosed in column five. Columns 8 and 9, "NT. From" and "NT 
To" respectively, delineate the polynucleotides in "SEQ ID NO:X" that encode a 
polypeptide having a significant match to the PFAM/NR database as disclosed in the 
fifth column. In specific embodiments, polypeptides of the invention comprise, or 
alternatively consist of, an amino acid sequence encoded by the polynucleotides in 
SEQ ID NO:X as delineated in columns 8 and 9, or fragments or variants thereof. 
[0231 Table 3 provides polynucleotide sequences that may be disclaimed according 
to certain embodiments of the invention. The first column provides a unique clone 
identifier, "Clone ID NO:Z", for a cDNA clone related to immune/hematopoietic 
associated contig sequences disclosed in Table 1A. The second column provides the 
sequence identifier, "SEQ ID NO:X", for contig polynucleotide sequences disclosed 
in Table 1 A. The third column provides the unique contig identifier, "Contig ID", for 
contigs disclosed in Table 1 A. The fourth column provides a.unique integer 'a' where 
'a' is any integer between 1 and the final nucleotide minus 15 of SEQ ID NO:X, 
represented as "Range of a", and the fifth column provides a unique integer V where 
*b' is any integer between 15 and the final nucleotide of SEQ ID NO:X, represented as 
"Range of b", where both a and b correspond to the positions of nucleotide residues 
shown in SEQ ID NOrX* and where b is greater than or equal to a + 14. For each of 
the polynucleotides shown as SEQ ID NO:X, the uniquely defined integers can be 
substituted into the general formula of a-b, and used to describe polynucleotides which 
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may be preferably excluded from the invention. In certain embodiments, preferably 
excluded from the polynucleotides of the invention (including polynucleotide 
fragments and variants as described herein and diagnostic and/or therapeutic uses 
based on these polynucleotides) are at least one, two, three, four, five, ten, or more of 
the polynucleotide sequence(s) having the accession number(s) disclosed in the sixth 
column of this Table (including for example, published sequence in connection with a 
particular BAC clone). In further embodiments, preferably excluded from the 
invention are the specific polynucleotide sequence(s) contained in the clones 
corresponding to at least one, two, three, four, five, ten, or more of the available 
material having the accession numbers identified in the sixth column of this Table 
(including for example, the actual sequence contained in an identified BAC clone). 

[024] Table 4 provides a key to the tissue/cell source identifier code disclosed in 
Table 1A, column 7. Column 1 provides the key to the tissue/cell source identifier 
code disclosed in Table 1 A, Column 7. Columns 2-5 provide a description of the tissue 
or cell source. Codes corresponding to diseased tissues are indicated in column 6 with 
the word "disease". The use of the word "disease" in column 6 is non-limiting. The 
tissue or cell source may be specific (e.g. a neoplasm), or may be disease-associated 
(e.g., a tissue sample from a normal portion of a diseased organ). Furthermore, tissues 
and/or cells lacking the "disease" designation may still be derived from sources 
directly or indirectly involved in a disease state or disorder, and therefore may have a 
further utility in that disease state Or disorder. In numerous cases where the tissue/cell 
source is a library, column 7 identifies the vector used to generate the library. 

[025] Table 5 provides a key to the OMIM™ reference identification numbers 
disclosed in Table 1A, column 9. OMIM reference identification numbers (Column 1) 
were derived from Online Mendelian Inheritance in Man (Online Mendelian 
Inheritance in Man, OMIM™. McKusick-Nathans Institute for Genetic Medicine, 
Johns Hopkins University (Baltimore, MD) and National Center for Biotechnology 
Information, National Library of Medicine, (Bethesda, MD) 2000. World Wide Web 
URL: http://www.ncbi.nlm.nih.gov/omim/). Column 2 provides diseases associated 
with the cytologic band disclosed in Table 1A, column 8, as determined from the 
Morbid Map database. 
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[026] Table 6 summarizes ATCC Deposits, Deposit dates, and ATCC designation 
numbers of deposits made with the ATCC in connection with the present application. 

(027J Table 7 shows the cDNA libraries sequenced, tissue source description, vector 
information and ATCC designation numbers relating to these cDNA libraries. 

1028] Table 8 provides a physical characterization of clones encompassed by the 
invention. The first column provides the unique clone identifier, "Clone ID NO:Z", 
for certain cDNA clones of the invention, as described in Table 1A. The second 
column provides the size of the cDNA insert contained in the corresponding cDNA 
clone. 

Definitions 

(029] The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

[030] In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. The 
term "isolated" does not refer to genomic or cDNA libraries, whole cell total or 
mRNA preparations, genomic DNA preparations (including those separated by 
electrophoresis and transferred onto blots), sheared whole cell genomic DNA 
preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide sequences of the present invention. 

[031] As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence encoding SEQ ID NO:Y or a fragment or variant thereof, a nucleic acid 
sequence contained in SEQ ID NO:X (as described in column 3 of Table 1A) or the 
complement thereof, a cDNA sequence contained in Clone ID NO:Z (as described in 
column 1 of Table 1 A and contained within a library deposited with the ATCC); a 
nucleotide sequence encoding the polypeptide encoded by a nucleotide sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or a fragment or variant thereof; or 
a nucleotide coding sequence in SEQ ID NO.B as defined in column 6 of Table IB or 
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the complement thereof. For example, the polynucleotide can contain the nucleotide 
sequence of the full length cDNA sequence, including the 5 1 and 3' untranslated 
sequences, the coding region, as well as fragments, epitopes, domains, and variants of 
the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having an amino acid sequence encoded by a polynucleotide of the invention 
as broadly defined (obviously excluding poly-Phenylalanine or poly-Lysine peptide 
sequences which result from translation of a polyA tail of a sequence corresponding to 
a cDNA). 

[032] As used herein, a immune/hematopoietic antigen" refers collectively to any 
polynucleotide disclosed herein (e.g., a nucleic acid sequence contained in SEQ ID 
NO:X or the complement therof, or cDNA sequence contained in Clone ID NO:Z, or a 
nucleotide sequence encoding the polypeptide encoded by a nucleotide sequence in 
SEQ ID NO:B as defined in column 6 of Table IB, or a nucleotide coding sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or the complement thereof and 
fragments or variants thereof as described herein) or any polypeptide disclosed herein 
(e.g., an amino acid sequence contained in SEQ ID NO:Y, an amino acid sequence 
encoded by SEQ ID NO:X, or the complement thereof, an amino acid sequence 
encoded by the cDNA sequence contained in Clone ID NO:Z, an amino acid sequence 
encoded by SEQ ID NO:B, or the complement thereof, and fragments or variants 
thereof as described herein). These immune/hematopoietic antigens have been 
determined to be predominantly expressed in hematopoietic tissues (e.g., bone 
marrow, fetal liver, and fetal spleen) or cells and tissues of the immune system (e.g., 
lymph nodes, spleen, B cells, T cells, monocytes, macrophages, dendritic cells, 
neutrophils, mast cells, basophils, and eosinophils) including normal or diseased 
tissues (as shown in Table 1 A column 7 and Table 4). 

[033] In the present invention, "SEQ ID NO:X" was often generated by overlapping 
sequences contained in multiple clones (contig analysis). A representative clone 
containing all or most of the sequence for SEQ ID NO:X is deposited at Human 
Genome Sciences, Inc. (HGS) in a catalogued and archived library. As shown, for 
example, in column 1 of Table 1A, each clone is identified by a cDNA Clone ID 
(identifier generally referred to herein as Clone ID NO:Z). Each Clone ID is unique to 
an individual clone and the Clone ID is ail the information needed to retrieve a given 
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clone from the HGS library. Furthermore, certain clones disclosed in this application 
have been deposited with the ATCC on October 5, 2000, having the ATCC 
designation numbers PTA 2574 and PTA 2575; and on January 5, 2001, having the 
depositor reference numbers TS-1, TS-2, AC-1, and AC-2. In addition to the 
individual cDNA clone deposits, most of the cDNA libraries from which the clones 
were derived were deposited at the American Type Culture Collection (hereinafter 
"ATCC"). Table 7 provides a list of the deposited cDNA libraries. One can use the 
Clone ID NO:Z to determine the library source by reference to Tables 6 and 7. Table 
7 lists the deposited cDNA libraries by name and links each library to an ATCC 
Deposit. Library names contain four characters, for example, "HTWE." The name of 
a cDNA clone (Clone ID NO:Z) isolated from that library begins with the same four 
characters, for example "HTWEP07". As mentioned below, Table 1A correlates the 
Clone ID NO:Z names with SEQ ID NO:X. Thus, starting with an SEQ ID NO:X, one 
can use Tables 1A, 6 and 7 to determine the corresponding Clone ID NO:Z, which 
library it came from and which ATCC deposit the library is contained in. Furthermore, 
it is possible to retrieve a given cDNA clone from the source library by techniques 
known in the art and described elsewhere herein. The ATCC is located at 10801 
University Boulevard, Manassas, Virginia 20110-2209, USA. The ATCC deposits 
were made pursuant to the terms of the Budapest Treaty on the international 
recognition of the deposit of microorganisms for the purposes of patent procedure. 
[034] In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous 
nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 
7.5 kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, 
polynucleotides of the invention comprise a portion of the coding sequences, as 
disclosed herein, but do not comprise all or a portion of any intron. In another 
embodiment, the polynucleotides comprising coding sequences do not contain coding 
sequences of a genomic flanking gene (i.e., 5' or 3' to the gene of interest in the 
genome). In other embodiments, the polynucleotides of the invention do not contain 
the coding sequence of more than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 
1 genomic flanking gene(s). 
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f035] A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences 
contained in SEQ ID NO:X, or the complement thereof (e.g., the complement of any 
one, two, three, four, or more of the polynucleotide fragments described herein), the 
polynucleotide sequence delineated in columns 8 and 9 of Table 2 or the complement 
thereof, and/or cDNA sequences contained in Clone ID NO:Z (e.g., the complement of 
any one, two, three, four, or more of the polynucleotide fragments, or the cDNA clone 
within the pool of cDNA clones deposited with the ATCC, described herein) and/or 
the polynucleotide sequence delineated in column 6 of Table IB or the complement 
thereof. "Stringent hybridization conditions" refers to an overnight incubation at 42 
degree C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl, 75 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 ug/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in O.lx SSC at about 65 degree C. 

[036] Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency), salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37 degree C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 
0.2M NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml 
salmon sperm blocking DNA; followed by washes at 50 degree C with 1XSSPE, 0.1% 
SDS. In addition, to achieve even lower stringency, washes performed following 
stringent hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

1037) Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to problems with compatibility. 
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[0381 Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone generated using oligo dT as a primer). 

10391 The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA that 
may be single-stranded or, more typically, double-stranded or a mixture of single- and 
double-stranded regions. In addition, the polynucleotide can be composed of triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. A polynucleotide 
may also contain one or more modified bases or DNA or RNA backbones modified for 
stability or for other reasons. "Modified" bases include, for example, tritylated bases 
and unusual bases such as inosine. A variety of modifications can be made to DNA 
and RNA; thus, "polynucleotide" embraces chemically, enzymatically, or 
metabolically modified forms. 
[0401 The polypeptide of the present invention can be composed of amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres, and may contain amino acids other than the 20 gene-encoded amino acids. 
The polypeptides may be modified by either natural processes, such as 
posttranslational processing, or by chemical modification techniques which are well 
known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid 
side-chains and the amino or carboxyl termini. It will be appreciated that the same 
type of modification may be present in the same or varying degrees at several sites in a 
given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched, for example, as a result of 
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ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, 
and branched cyclic polypeptides may result from posttranslation natural processes or 
may be made by synthetic methods. Modifications include acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
cross-linking, cyclization, disulfide bond formation, demethylation, formation of 
covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, 
methylation, myristoylation, oxidation, pegylation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth. 
Enzymol. 182:626-646 (1990); Rattan et al., Ann. N.Y. Acad. Sci. 663:48-62 (1992).) 

[041] "SEQ ID NO:X" refers to a polynucleotide sequence described, for example, in 
Tables 1 A or 2, while "SEQ ID NO:Y" refers to a polypeptide sequence described in 
column 5 of Table 1A. SEQ ID NO:X is identified by an integer specified in column 3 
of Table 1A. The polypeptide sequence SEQ ID NO:Y is a translated open reading 
frame (ORF) encoded by polynucleotide SEQ ID NO:X. "Clone JJD NO:Z" refers to a 
cDNA clone described in column 1 of Table 1 A. 

[0421 "A polypeptide having biological activity" refers to a polypeptide exhibiting 
activity similar to, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency does 
exist, it need not be identical to that of the polypeptide, but rather substantially similar 
to the dose-dependence in a given activity as compared to the polypeptide of Hie 
present invention (i.e., the candidate polypeptide will exhibit greater activity or not 
more than about 25-fold less and, preferably, not more than about tenfold less activity, 
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and most preferably, not more than about three-fold less activity relative to the 
polypeptide of the present invention). 
[043] Table 1A summarizes some of the immune/hematopoietic associated 
polynucleotides encompassed by the invention (including contig sequences (SEQ ID 
NO:X) and clones (Clone ID NO:Z) and further summarizes certain characteristics of 
these polynucleotides and the polypeptides encoded thereby. 

Polynucleotides and Polypeptides 

TABLE 1A 
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2,L0766:2,S0114: 1, 
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614930 


933143 


690404 


703843 


966608 


786707 


576434 


578887 


573103 


921417 


506594 


HBJMK34 


HBJMK94 


HBJML28 


HBJML69 


HBJMM72 


HBJMN75 


HBJMQ86 


HBJMR15 


HBJMR60 


HBJMT52 


HBJMV72 


HBJMW20 


HBJMX04 
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HBJMX29 
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L0766: 2, L0747: 2, L0779: 
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H0583: 1.S0116: 1.L0800: 
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S0031: landH0444: 1. 


I HOI 79: 1 andH0087: 1. 
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1-186 | 
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825406 


518245 


957396 


971651 


860967 


537336 


669916 


| HNFD084 


I HNFDP09 


HNFDR35 


| HNFDR83 


HNFDV41 


| HNFED22 


HNFED43 


HNFEF43 
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[044] The first column in Table 1 A provides a unique "Clone ID NO:Z" for a cDNA 
clone related to each contig sequence disclosed in Table 1A. This clone ID references 
the cDNA clone which contains at least the 5' most sequence of the assembled contig 
and at least a portion of SEQ ID NO:X was determined by directly sequencing the 
referenced clone. The reference clone may have more sequence than described in the 
sequence listing or the clone may have less. In the vast majority of cases, however, the 
clone is believed to encode a full-length polypeptide. In the case where a clone is not 
full-length, a full-length cDNA can be obtained by methods known in the art and/or as 
described elsewhere herein. 
[045] The second column in Table 1A provides a unique "Contig ID" identification 
for each contig sequence. The third column provides the "SEQ ID NO:X" identifier for 
each of the immune/hematopoietic associated contig polynucleotide sequences disclosed 
in Table 1A. The fourth column, "ORF (From-To)", provides the location (i.e., 
nucleotide position numbers) within the polynucleotide sequence "SEQ ID NO:X" that 
delineate the preferred open reading frame (ORF) shown in the sequence listing and 
referenced in Table 1A, column 5, as SEQ ID NO:Y. Where the nucleotide position 
number "To" is lower than the nucleotide position number "From", the preferred ORF is 
the reverse complement of the referenced polynucleotide sequence. 
[046] The fifth column in Table 1A provides the corresponding SEQ ID NO:Y for 
the polypeptide sequence encoded by the preferred ORF delineated in column 4. In one 
embodiment, the invention provides an amino acid sequence comprising, or 
alternatively consisting of, a polypeptide encoded by the portion of SEQ ID NO:X 
delineated by "ORF (From-To)". Also provided are polynucleotides encoding such 
amino acid sequences and the complementary strand thereto. 
[047] Column 6 in Table 1A lists residues comprising epitopes contained in the 
polypeptides encoded by the preferred ORF (SEQ ID NO:Y), as predicted using the 
algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. The Jameson- 
Wolf antigenic analysis was performed using the computer program PROTEAN 
(Version 3.11 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street 
Madison, WI). In preferred embodiments, polypeptides of the invention comprise, or 
alternatively consist of, at least one, two, three, four, five or more of the predicted 
epitopes as described in Table 1A. It will be appreciated that depending on the 
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analytical criteria used to predict antigenic determinants, the exact address of the 
determinant may vary slightly. 
[048] Column 7 in Table 1A provides an expression profile and library code: count 
for each of the contig sequences (SEQ ID NO:X) disclosed in Table 1A, which can 
routinely be combined with the information provided in Table 4 and used to determine 
the normal or diseased tissues, cells, and/or cell line libraries which predominantly 
express the polynucleotides of the invention. The first number in column 7 (preceding 
the colon), represents the tissue/cell source identifier code corresponding to the code and 
description provided in Table 4. For those identifier codes in which the first two letters 
are not "AR", the second number in column 7 (following the colon) represents the 
number of times a sequence corresponding to the reference polynucleotide sequence was 
identified in the tissue/cell source. Those tissue/cell source identifier codes in which the 
first two letters are "AR" designate information generated using DNA array technology. 
Utilizing this technology, cDNAs were amplified by PCR and then transferred, in 
duplicate, onto the array. Gene expression was assayed through hybridization of first 
strand cDNA probes to the DNA array. cDNA probes were generated from total RNA 
extracted from a variety of different tissues and cell lines. Probe synthesis was 
performed in the presence of 33 P dCTP, using oligo(dT) to prime reverse transcription. 
After hybridization, high stringency washing conditions were employed to remove non- 
specific hybrids from the array. The remaining signal, emanating from each gene target, 
was measured using a Phosphorimager. Gene expression was reported as Phosphor 
Stimulating Luminescence (PSL) which reflects the level of phosphor signal generated 
from the probe hybridized to each of the gene targets represented on the array. A local 
background signal subtraction was performed before the total signal generated from 
each array was used to normalize gene expression between the different hybridizations. 
The value presented after "[array code]:" represents the mean of the duplicate values, 
following background subtraction and probe normalization. One of skill in the art could 
routinely use this information to identify normal and/or diseased tissue(s) which show a 
predominant expression pattern of the corresponding polynucleotide of the invention or 
to identify polynucleotides which show predominant and/or specific tissue and/or cell 
expression. The sequences disclosed herein have been determined to be predominantly 
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expressed in immune/hematopoietic tissues, including normal and diseased 
immune/hematopoietic tissues (See Table 1 A, column 7 and Table 4). 

[049] Column 8 in Table 1A provides a chromosomal map location for certain 
polynucleotides of the invention. Chromosomal location was determined by finding 
exact matches to EST and cDNA sequences contained in the NCBI (National Center for 
Biotechnology Information) UniGene database. Each sequence in the UniGene database 
is assigned to a "cluster"; all of the ESTs, cDNAs, and STSs in a cluster are believed to 
be derived from a single gene. Chromosomal mapping data is often available for one or 
more sequence(s) in a UniGene cluster; this data (if consistent) is then applied to the 
cluster as a whole. Thus, it is possible to infer the chromosomal location of a new 
polynucleotide sequence by determining its identity with a mapped UniGene cluster. 

[050] A modified version of the computer program BLASTN (Altshul et al., J. Mol. 
Biol. 215:403-410 (1990), and Gish et al., Nat Genet. 3:266-272 (1993)) was used to 
search the UniGene database for EST or cDNA sequences that contain exact or near- 
exact matches to a polynucleotide sequence of the invention (the 'Query*). A sequence 
from the UniGene database (the Subject*) was said to be an exact match if it contained 
a segment of 50 nucleotides in length such that 48 of those nucleotides were in the same 
order as found in the Query sequence. If all of the matches that met this criteria were in 
the same UniGene cluster, and mapping data was available for this cluster, it is indicated 
in Table 1A under the heading "Cytologic Band". Where a cluster had been further 
localized to a distinct cytologic band, that band is disclosed; where no banding 
information was available, but the gene had been localized to a single chromosome, the 
chromosome is disclosed. 

[051] Once a presumptive chromosomal location was determined for a 
polynucleotide of the invention, an associated disease locus was identified by 
comparison with a database of diseases which have been experimentally associated with 
genetic loci. The database used was the Morbid Map, derived from OMIM™ (supra). If 
the putative chromosomal location of a polynucleotide of the invention (Query 
sequence) was associated with a disease in the Morbid Map database, an OMIM 
reference identification number was noted in column 9, Table 1A, labeled "OMIM 
Disease Reference(s)". Table 5 is a key to the OMIM reference identification numbers 
(column 1), and provides a description of the associated disease in Column 2. 
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TABLE IB 



Clone ID 
NO:Z 


SEQ ID NO:X 


CONTIG ID: 


BAC ID: A 


SEQ ID NO:B 


EXON 
From-To 


HAMHB21 


11 


961376 


AL035530 


19515 


1-137 
465-2035 
2761-2939 
3499-3663 
3841-4296 

5707-5905 
5946-6330 
6790-6899 
6957-7401 
7628-7818 
7889-10548 


HAMHB21 


11 


961376 


AL035530 


19516 


1-577 


HASAX16 


12 


573692 


AL358796 


19517 


1-1065 


HASAX16 


12 


573692 


AL356513 


19518 


1-1281 


HASAX16 


12 


573692 


AL358796 


19519 


1-964 


HAS AX 16 


12 


573692 


AL356513 


19520 


1-540 


HASAX16 


12 


573692 


AL356513 


19521 


1-967 


HASAY74 


13 


526312 


AC006479 


19522 


1-1081 


HASAY74 


13 


526312 


AC006479 


19523 


1-1052 


HASAY74 


13 


526312 


AC006479 


19524 


1-117 


HASAY89 


14 


958768 


AC012580 


19525 


1-874 


HASAY89 


14 


958768 


AL133502 


19526 


1-874 


HASAY89 


14 


958768 


AC012580 


19527 


1-87 


HASAY89 


14 


958768 


AC012580 


19528 


1-1034 
1426-2177 


HASAY89 


14 


958768 


AL1 33502 


19529 


1-1034 
1426-2177 


HASAY94 


15 


521835 


AC022702 


19530 


1-220 
2365-2620 
2992-3310 

5266-5325 
58 14-6468 
6801-6965 
7327-7763 
7979-8172 


HASAY94 


15 


521835 


AC025796 


19531 


1-272 


HASAY94 


15 


521835 


AC024998 


19532 


1-272 


HASAY94 


15 


521835 


AL365438 


19533 


1-272 


HASAY94 


15 


521835 


AL390122 


19534 


1-272 


HASAY94 


15 


521835 


AL133216 


19535 


1-306 
1488-1890 
2450-2722 
3082-3400 
3526-4081 
4304-5314 


HASAY94 


15 


521835 


AF198096 


19536 


1-308 
1490-1896 
2457-2728 
2816-3407 
3532-4087 
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4313-5008 
3584-6209 




ID 


Ml QIC 






1-592 




1 < 
I j 


COIOOC 


AlAJZ/ /UZ 




I-olo 


HASAY94 


15 


521835 


AC025796 


19539 


1-592 


HASAY94 


15 


521835 


AL365438 


19540 


1-591 


HASAY94 


15 


521835 


AL390122 


19541 


1-592 


HASAY94 


15 


521835 


AL133216 


19542 


1-750 


HASAY94 
• 


15 


521835 


AL133216 


19543 


1-138 
734-1117 
1716-2301 
2496-3003 
3142-3588 
5476-6185 
6679-7322 
7428-7952 
8163-8628 
9071-9154 
9380-9538 
10422-10640 
10673-10765 
10780-10828 
11018-11735 
12735-13548 
13651-14193 
14663-14949 

4\ A* A*\ 4 AW A Ap At 

15056-15454 
17688-18060 
19315-19659 


XT A O A \?f%A 


15 


521835 


AF198096 


1 f\j? A A 

19544 


1-2408 


TT A O A \7C\A 

HASAY94 


15 


521835 


AF198096 


19545 


1-373 




17 


t\CA DTI 

954871 


AL023808 


19546 


1-606 


HBCAL39 


17 


964871 


AL354975 


19547 


1-609 


HBCAL39 


17 


964871 


AL355583 


19548 


1-609 


HBCAL39 


17 


964871 


AL1 17337 


19549 


1-607 


HBCAL39 


17 


964871 


AL161931 


19550 


1-607 


HBCAL39 


17 


964871 


AL022345 


19551 


1-83 
1585-1635 
1808-2378 
5485-5902 
6116-6277 
8376-8779 
8822-8972 
9168-9821 
11260-11538 
13579-13698 
17448-17520 
17571-17848 
18839-19282 
19624-20151 
20479-21084' 
21175-21448 
32819-33061 
34887-35180 
35454-36250 
37565-38021 
38576-38855 
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40356-40851 
40884-41362 
43851-44422 
45111-46319 
46743-49458 
50723-51179 

^71 7A_^71GO 

5532R-55457 
55462-56374 

•/•/TV/A JW / ~ 

58253-58708 


HBCAL39 


17 


964871 


AF205588 


19552 


1-607 


HBCAL39 


17 


964871 


AL354975 


19553 


1-274 


HBCAL39 


17 


964871 


AL023808 


19554 


1-528 


HBCAL39 


17 


964871 


AL354975 


19555 


1-528 


HBCAL39 


17 


964871 


AL355583 


19556 


1-274 


HBCAL39 


17 


964871 


AL355583 


19557 


1-528 


HBCAL39 


17 


964871 


AL1 17337 


19558 


1-273 


HBCAL39 


17 


964871 


AL1 17337 


19559 


1-526 


HBCAL39 


17 


964871 


AL1 61931 


19560 


1-273 


HBCAL39 


17 


964871 


AL161931 


19561 


1-526 


HBCAL39 


17 


964871 


AL022345 


19562 


1-465 


HBCAL39 


17 


964871 


AF205588 


19563 


1-526 


HBCAL39 


17 


964871 


AF205588 


19564 


1-273 


HBCAR79 


19 


573989 


AC021215 


19565 


1-432 


HBCAT17 


21 


503573 


AC011448 


19566 


1-874 


HBCAT17 


21 


503573 


ACO 11448 


19567 


1-749 


HBCAT63 


22 


573993 


AL133243 


19568 


1-236 


HBCAT63 


22 


573993 


AL1 33243 


19569 


1-364 


HBCAT63 


22 


573993 


AL133243 


19570 


1-141 


HBCBX12 


24 


861018 


AC068475 


19571 


1-70 
163-330 
651-994 
1105-1272 
1372-1586 
2253-2908 
2995-3524 
3711-4406 
4418-4480 
■ 4581-5218 
5621-5829 
6007-6286 


HBCBX12 


24 


861018 


AC005954 


19572 


1-91 
893-1009 
1323-1695 
1856-2422 
3548-3650 
3665-4010 
4965-5170 
5288-5397 
6874-7000 
7283-7430 
7520-7600 
7693-7860 
8181-8524 
8634-8807 
8902-9116 
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9783-10438 
10525-11054 
11241-11936 
11948-12010 
12111-12748 
13154-13362 
13540-13833 
14748-14851 
14928-15142 
15543-15616 
17091-17240 
17351-18020 
18331-18662 
19524-19871 
19999-20209 
20570-20670 
20861-21075 
22489-22727 
22961-23073 
25307-25360 
29573-29961 
31051-31168 


HBCBX12 


24 


861018 


AC005954 


19573 


1-131 


HBDAC79 


26 


935414 


AP001785 


19574 


1-99 
368-765 
963-1141 
1966-2078 
2175-2597 
3058-3200 
3479-3686 
4113-4631 
4727-5095 
5187-5344 
5973-6073 
6176-7972 
8126-9308 


HBDAC79 


26 


935414 


AP001785 


19575 


1-224 


HBDAC79 


26 


935414 


AP001785 


19576 


1-428 


HBDAD04 


27 


614849 


AC055822 


19577 


1-970 


HBDAD04 


27 


614849 


AC055822 


19578 


1-424 


HBDAD04 


27 


614849 


AC055822 


19579 


1-281 


HBDAF51 


28 


725481 


AC041031 


19580 


1-709 


HBDAF51 


28 


725481 


AC012314 


19581 


1-859 
928-1588 
2180-2459 
2554-2831 
2984-3375 
3783-4571 
4867-5280 
5713-7103 


HBDAF51 


28 


725481 


AC010492 


19582 


1-859 
92°-15°" 
2180-2459 
2553-2830 
2983-3374 
3782-4570 
4866-5279 
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5719-7109 


HBDAF51 


28 


725481 


AC012314 


19583 


1-498 


HBDAF51 


28 


725481 


ACO 10492 


19584 


1-49X 

l"*TZO 


HBDAF61 


29 


864338 


AC003963 


19585 


1-1940 


HBDAF61 


29 


864338 


AC003963 


19586 


1-589 


HBDAF61 


29 


864338 


AC003963 


19587 


1-106 


HBJAB59 


30 


557972 


AC073349 


19588 


1-596 


HBJAB59 


30 


557972 


AC073349 


19589 


1-107 


HBJAB59 


30 


557972 


AC073349 


19590 


1-UvO 


HBJAC23 


31 


529753 


AC008013 

ilivvvuu 1 *J 


X J J J x 
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19£19 19770 
XZOJO-XZ/ i\J 












xzy**x-x,suy / 












13610-13902 












13943-14422 












15994-16388 












16841-17233 












17379-18129 


HBJAC23 


31 


529753 


AC008013 


19593 


1-294 
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HBJAC23 


31 


529753 


AC008013 


19594 


1-289 












692-1099 












1395-1823 












2203-2837 












3577-4809 












4987-5449 












5855-6157 












6332-6552 












6672-6725 












6773-6994 












7254-7358 












7391-7785 












7880-8597 












10291-10690 












10700-11175 












11393-11530 












11644-11749 


TTTJ T A 1 

ItdJAu/3 


1 1 


529753 


AC006432 


19595 


1-290 


ITOTAmO 
ixdJAu // 


51 


722723 


AC022533 


19596 


1-479 


iIdJ AO /z 


51 


722723 


AC068898 


19597 


1-479 


TJTT3T A PT1 

HBJAG72 


32 


722723 


AC022533 


19598 


1-140 


HBJAG72 


32 


722723 


AC068898 


19599 


1-140 


HBJAJ75 


34 


953840 


AL133163 


19600 


1-98 












211-417 












784-1004 












1875-2071 












2745-2913 












3205-4214 












4242-4552 












4695-5574 












6331-6850 












7048-7631 












7696-8408 












8763-8873 












8944-10131 












10166-10740 


TTOTA T7< 


34 


953840 


AT 1 H * ^1 

AL133163 


19601 


1-535 


TTOTA TQ^ 


3D 


0759Q4 


AC002450 


19602 


1-117 


TTDTA TO<T 


35 


675904 


AC002450 


19603 


1-894 


HBJAJ85 


35 


675904 


AC002450 


19604 


1-501 


HBJAV57 


36 


527998 


ACX)04976 


19605 


1-140 












348-434 












2467-2856 












3263-3461 












4014-4463 












4995-5067 












6578-6638 












7115-7236 


iiBJAV37 


36 


527998 


AC004976 


19606 


1-273 


X10JAY91 


38 


828026 


AC067953 


19607 


1-341 


TJT5 T A V0 1 


38 


828026 


AC067953 


19608 


1-180 


xiDJAYyl 


•5 O 

38 


828026 


AC067953 


19609 


1-364 


HBJBM14 


39 


/ OU/O 




1Q£1 ft 


1 AM 

1-407 


HBJBM14 


39 


781398 


AC022239 


19611 


1-463 


HBJBM14 


39 


781398 


AC022239 


19612 


1-663 


HBJBU55 


41 


527112 


AC024631 


19613 


1-580 


HBJBU55 


41 


527112 


AC024631 


19614 


1-103 
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HBJCD43 


42 


714390 


AC013726 


19615 


1-76 
903-1151 
6010-6078 
6161-6277 
8977-9064 
9732-9789 
10007-10050 
10340-10668 
10685-10790 

13579-13684 

1-HoJ-AHZ / / 
171 1tf-171*M 

1 / I 1 O" 1/1 J*T 


HBJCD43 


42 


714390 


AC013726 


19616 


1-1R1 
186-409 


HBJCD43 


42 


714390 


AC013726 


19617 


1.315 


HBJCJ68 


44 


823468 


AC022827 


19618 


1-223 


HBJa68 


44 


823468 


AC025429 


19619 


1-223 


HBJCJ68 


44 


823468 


AC022827 


19620 


1-255 


HBJCJ68 


44 


823468 


AC022827 


19621 


1-331 


HBJCJ68 


44 


823468 


AC025429 


19622 


1-331 


HBJCJ68 


44 


823468 


AC025429 


19623 


1-254 


HBJDL73 


48 


531104 


AC073041 


19624 


1-80 
1609-2611 

Zolo-JUW 

3929-4450 
5011-5181 
6243-6662 
7347-7904 


HBJDL73 


48 


531104 


AC007401 


19625 


1-115 
615-834 
2847-2983 
6954-6980 
7541-7661 
9579-9723 
12147-12951 
13592-13749 
17487-18023 
23251-23354 
24659-24888 
25992-26409 
30147-30227 
31757-32759 
32961-33188 
34077-34598 
35159-35329 
36391-36810 
37495-39032 
39741-42310 

44277-44792 
46043-46323 
46392-47319 


HBJDO70 


50 


752830 


AC026134 


19626 


1-539 


HBJDO70 


50 


752830 


AC026134 


19627 


1-169 
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XTQTTVY7fl 


CA 

5U 


/5zo3U 


AC026134 


19628 


1-421 


HBJL>r32 


51 


CT3 <M£ 

573846 


AC046176 


1 A/"A 

19629 


1-2934 


HBJDP32 


51 


573846 


AC046176 


19630 


1-357 


HBJDP41 


52 


494836 


AC015764 


19631 


1-90 












1233-1633 












2222-2584 












3024-3326 












3819-3944 












4361-4505 


HBJDP41 


52 


494836 


AC011667 


19632 


1-138 












1281-1681 












2270-2632 












3070-3372 












3865-3990 












4407-4644 












8378-8745 












8970-10143 












10599-10746 












11435-11585 












11809-12440 












12785-14150 


IXDTTYD/f 1 


co 

52 


494836 


AC012052 


19633 


1-401 


xUdJ Urn I 


CO 

52 


494836 


AC012052 


19634 


1-363 


tjtx TTYTM'7 


55 


573728 


ACO 16555 


1 A£OC 

19635 


1-330 


xIdJU 14/ 


cc 

55 


573728 


AC010243 


19636 


1-330 


ITDTTYIM7 


55 


573728 


ACG10897 


19637 


1 *1A 

1-330 


rLBJDW3o 


57 


573847 


AC023933 


19638 


1-460 


HBJDW36 


57 


573847 


AC023933 


19639 


1-231 


T lit ¥1 NTT T*» y" 

HBJDW36 


57 


573847 


AC023933 


19640 


l-3ll 


HBJDX18 


58 


795732 


AC007546 


19641 


1-103 












1091-1218 












1856-2074 












2112-2398 












2749-2937 












4357-4400 












5823-5913 












6287-6977 












7105-7443 












8618-8818 












9074-9299 












10264-12381 












12541-13026 












13249-13333 












13355-14023 












14447-14788 












14908-15500 












15562-16612 












17335-17834 












18610-18726 












21062-21312 












21764-21927 












24230-24297 












25325-27507 












27761-28166 












28198-28323 
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